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Abstract 

This paper gives a comprehensive treatment of local uniqueness, 
asymptotics and numerics for intrinsic means on the circle. It turns out 
that local uniqueness as well as rates of convergence are governed by the 
distribution near the antipode. In a nutshell, if the distribution there is 
locally less than uniform, we have local uniqueness and asymptotic nor- 
mality with a rate of n -1,/2 . With increased proximity to the uniform dis- 
tribution the rate can be arbitrarly slow, and in the limit, local uniqueness 
is lost. Further, we give general distributional conditions, e.g. unimodal- 
ity, that ensure global uniqueness. Along the way, we discover that sample 
means can occur only at the vertices of a regular polygon which allows 
to compute intrinsic sample means in linear time from sorted data. This 
algorithm is finally applied in a simulation study demonstrating the de- 
pendence of the convergence rates on the behavior of the density at the 
antipode. 
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1 Introduction 

The need for statistical analysis of directional data arises in many applications, 
be it in the study of wind direction, animal migration or geological crack devel- 
opment. An overview of and introduction into this field can be found in Mardia 
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and Jupp (2000). Until today, nonparametric inferential techniques employing 
the intrinsic mean, i.e. the minimizer of expected squared angular distances, 
rest on the assumption, that distributions underlying circular data have no mass 
in an entire interval opposite to an intrinsic mean. This assumption, however, is 
not met by any of the standard distributions for directional data, e.g. Fisher-von 
Mises, Bingham, wrapped normal or wrapped Cauchy distributions. 

The reason for this mathematical assumption lies in the fact that the intrin- 
sic distance is not differentiable for two antipodal points. However, the central 
limit theorem for intrinsic means on general manifolds derived by Bhattacharya 
and Patrangenaru (2005), cf. also Huckemann (2010), utilizes a Taylor expan- 
sion of the intrinsic variance by differentiating under the integral sign, i.e. by 
differentiating the squared intrinsic distance. In consequence, this has left the 
derivation of a central limit theorem along with convergence rates for circular 
data of most realistic scenarios and distributions an open problem until now. 

Here, we fill this gap and provide for a comprehensive solution. In particular, 
we show that, if the distribution near the antipode stays below the uniform dis- 
tribution, asymptotic normality with the n -1 / 2 rate well known from Euclidean 
statistics remains valid, the asymptotic variance however increases with proxim- 
ity to the uniform distribution. Furthermore, if the distribution at the antipode 
differs from the uniform distribution only in higher order, then the asymptotic 
rate is accordingly lowered by this power. If and only if the distribution at the 
antipode is locally uniform the intrinsic mean is no longer locally unique. More- 
over, at an antipode of an intrinsic mean, there may never be more mass than 
that of the uniform distribution. In particular there can never be a point mass. 
For these reasons, sample means are always locally unique and we will sec that 
they can only occur on the vertices of a regular polygon. Hence from ordered 
data, sample means can be computed in linear time. These insights also allow 
to derive general distributional assumptions such as unimodality under which 
there is only one local minimizer and hence a unique intrinsic mean. 

This last result extends the result of Le (1998) who guarantees uniqueness for 
distributions symmetric with respect to a point, being non-increasing functions 
of the distance to this point, strictly decreasing on a set of positive circular 
measure, cf. also Kaziska and Srivastava (2008), as well as the very general 
result of Afsari (2011) which in our case of the circle, yields uniqueness if the 
distribution is restricted to an open half circle. 

In the following Section 2 we introduce notation and the concepts of intrin- 
sic means and intrinsic variance. Then we derive in Section 3 our first central 
result for the distribution at the antipode and collect consequences concerning 
uniqueness and algorithmic methods. In particular, we study the loci of local 
minimizers and the maximal number of such. Section 4 derives various versions 
of central limit theorems depending on the proximity to the uniform distribu- 
tion near the antipode. In Section 5 we simulate some cases of non-standard 
asymptotics derived before. We conclude with a discussion of our results and 
an outlook to large sample asymptotics of data assumed on general manifolds. 
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2 Setup 



Throughout this paper we consider the unit circle S 1 as the interval [— tt, tt) with 
the endpoints identifed. More precisely we equip this interval with the topology 
generated by the usual topology of {—it, tt) and by all [— tt, e — tt) U (tt — e', tt), 
< e, e' < tt, a base of neighborhoods of —tt. On S* 1 we consider random 

elements Xx, . . . ,X n l X mapping from a given probability space (Q,A,¥) 
to [— 7r,7r), and the intrinsic distance d(fx,X) := \fj, — X — 2ttl , ( x \h)\ from X 
to a given point /i <E S" 1 with 

{1 if ^ > 0,X e [-tt,(X- tt) , 
-1 if fx < 0,X e (M + 7r,7r) , 
else. 

In this paper, with the antipodal map <ft '■ S 1 — > S 1 

,,-._{ x — tt if0<a;<7r, 
~ \ x + tt if-7r<a;<0, 

the antipodal set S :— {4>{x) : x £ S} of S C S 1 will play a central role. 

Letting E denote the usual Euclidean expectation, we are looking for mini- 
mizers fi £ [—tt, tt) of 

1 ™ 

V{ji) := E (d( f i, X) 2 ) , and V n (p) ■= ~J2 X ^ ' 

Remark 2.1. Note that V and V n , being continuous functions on the compact 
S , always feature at least one global minimizer. 

Definition 2.2. Every such global minimizer fx* ofV and fx n ofV n is called an 
intrinsic population mean and an intrinsic sample mean, respectively. The val- 
ues V(fx*) and V n (fx n ) are called total intrinsic population and sample variance, 
respectively. 

Obviously, global minimizers need not be unique, and there may be local 
but non-global minimizers, as the examples below will show. 

Denoting the usual average by X — \ S?=i we have that —tt < X < tt. 
Since V(fx) < E|/x — X\ 2 with equality for /i = 0, the latter functional being 
minimized by fx = EX, we see that fx* — locally minimizing V implies EX = 0; 
cf. also Kobayashi and Nomizu (1969, Section VIII. 9) as well as Karcher (1977). 
The classical Euclidean Central Limit Theorem hence gives 

Vn~X 4 N(0,<7 2 ) (1) 

with the Euclidean variance Var(X) = a 2 . If fx* = is also a global minimizer 
of V, i.e. an intrinsic mean, then the Euclidean variance agrees with the total 
intrinsic variance 

V(0) = a 2 . 
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If we were interested whether any other \i € S 1 were an intrinsic mean, we 
could apply a simple rotation of the circle to reduce this to the case \i = 
again. Without loss of generality we can hence restrict our attention to this 
special /u; recall that its antipode is <j>(0) = — n. 



3 Local and Global Minimizers 

3.1 The Distribution Near The Antipode 

Here is our first fundamental Theorem. 

Theorem 3.1. If n* = locally minimizes V, then: 

(i) Px(— 71") = 0, i-e. there is no point mass opposite an intrinsic mean. 

(ii) If additionally there is some 6 > s.t. Fx restricted to (—ir,—n + S) 
features a continuous density f w.r.t. Lebesgue measure, then /(— tt+) = 
lim^^ f(pi) < ±; similarly /(tt-) = lim^ f(pi) < ± for a continuous 
density f on (n — 5, tt). If both /(— n+) < and f(n—) < then pi* is 
locally unique. 

(Hi) In case /(— 7r) = j^, f being continuous in a neighborhood of —ir, as- 
sume that there is some S > s.t. f is k-times continuously differen- 
tiable on [— n, — tt + 5) (i.e. with existing left limits) and k-times contin- 
uously differ entiable on (tt — S,tt] (i.e. with existing right limits), where 
k and k are chosen minimal s.t. f^ k \—TV+) = lim jU j,_ 7r f^ (ji) =^ as 
well as f ik) (ir-) = lim^/'*!^) 7^ 0. Then, f {k) (-ir+) < as well as 
(— l) fe /( fe )(7r— ) < 7 pL* is locally unique, and for 5 > small enough 

X Xk+l 

P{-tt <X<-n + S} = - + ^—^f(k\-n+) + o(<5 fe+1 ), 
as well as 

P{tt - 5 < X < tt} = — + -= (-l) k f (k) (n-) + o(5 k+1 ). 

Proof. For any \i > 0, we have 

/TT p — 71"+ p. P7T 

(yu — x) 2 dP x (x)+ {n-x- 2tt) 2 d¥ x (x) - / x 2 dP x (x) 

-TT + fJ, J —TT J —TT 

/TT f> — TT-\-[l 

{{pi-x) 2 -x 2 ) df x {x)+ / (-4tt( M -x) +4ir 2 )dP x (x) 

/-7T+JU 
(-n + fi-x) d¥ x (x) 
-TT S v ' 

< pi 2 -47r^P x (-7r) 



=° >o 



4 



which for /j, small enough becomes negative if ¥x(— tt) > 0, whence (i) follows. 

Now denote the (shifted) cumulative distribution function (c.d.f.) of Y = 
X + tt by 

F{p) = P{0 < Y < fj,} = P{-7T < X < -TT + fl} (2) 

for /x > to obtain 

\-ir+(i-x)dF x (x)= [\fi-y)F'(y)dy = OF{fi)-fiF(0) + F(y)dy, 



F' being understood in a distributional sense. Noting that ^ 2 = 47r ^dx 
where ^ is the (shifted) c.d.f. of the uniform distribution on [— tt,tt), we see 
that 

V(p) > 7(0) iff jf - F(aO) ^ > 0, (3) 

where equality holds simultaneously, too. 

Thus, under the assumptions of (ii), we may use a 2nd order Taylor expansion 
to obtain for /i > small enough 

hence f(-w+) = F'(0+) < If this inequality is strict, 7(/i) > 7(0) follows 
for small enough. With the analogous argument for fi < this yields the 
assertion in (ii). 

Finally, (iii) is obtained by a Taylor expansion as well, namely (for // > 0) 
which after integration, noting that uniqueness implies a sharp inequality, gives 

whence f^ k '(—n+) < follows as well as local uniqueness. The case fi < is 
again treated analogously. □ 



3.2 Consequences for Uniqueness, Loci of Local Minimiz- 
ers and Algorithms 

From Theorem 3.1 we obtain at once a necessary and sufficient condition such 
that a minizer of 7 is locally unique. 

Corollary 3.2. By (3), V is constant on some interval S C S 1 iff the probability 
distribution restricted to the antipodal interval S has constant density J^- there. 
In particular, suppose that /i* £ S 1 is a local minimizer ofV. Then [i* is locally 
unique iff there is no interval jj,* 6 S C S 1 such that the distribution restricted 
to S is uniform. 
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The following is a generalization of a result by Rabi Bhattacharya (personal 
communication from 2008): 

Corollary 3.3. If the distribution of X has a density w.r.t. Lebesgue measure 
which is composed of finitely many pieces, each being analytic up to the interval 
boundaries, then any local minimizer of V is locally unique unless the density is 
constant on some interval. 

Proof. This follows immediately since an analytic function is constant unless 
one of its derivatives is non-zero. □ 

Proposition 3.4. Consider the distribution of X, decomposed into the part A 
which is absolutely continuous w.r.t. Lebesgue measure, with density f, and the 
part r\ singular to Lebesgue measure. Let S\, . . . , Sk be the distinct open arcs 
on which f < w±, assume they are all disjoint from suppry, and that {x £ S 1 : 
f(x) = r^} is a Lebesgue null-set. Then X has at most k intrinsic means and 

every Sj contains at most one candidate. 

Proof. Suppose that /i* = is a local minimizer of V . By hypothesis and virtue 
of Corollary 3.2 there is an open arc S\ 3 —it followed (going into positive x- 
direction) by a closed arc T\ such that f(x) < l/(2n) for x E Si\ {— tt} and 
f(x) > 1/(2tt) for x £ Ti a.e.. Let Si = (-8', 6) and T\ = [8,8"} for some 
< S',S and 5" > 8. It suffices to show that no < /i < 8" can be another 
minimizer of V. 

To this end with F from (2) consider 

G(p) = £ - Ffji) and H ( M ) = T G(x)dx = T - F(x)) dx , 

cf. (3) and recall that is a minimizer of V iff H (/i) > 0. By construction, G' 
is positive on (0,8) and it is negative on [8,8'] a.e. Hence G is continuous and 
G(0) = < G(fi) for small fi > 0. Hence, H (p) > for all < fi < 8. Arguing 
again with Corollary 3.2 that there cannot be a minizer of V if its antipode 
carries more density than the uniform density, gives that 8 < /j < 8" cannot be 
a minimizer either, completing the proof. □ 

We note two straightforward consequences. 

Corollary 3.5. 

(i) Every population mean of a unimodal distribution is globally unique. 

(ii) If the distribution of X is composed of k < oo point masses at distinct 
locations, then V has at most k local minimizers, each being locally unique; 
for each interval formed by two neighboring point masses there is at most 
one local minimizer in the interior of the antipodal interval. 

(Hi) In particular, any intrinsic sample mean is locally unique. 
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Curiously, candiates for intrinsic sample means are very easy to obtain from 
one another. 



Corollary 3.6. For a sample X±, . . . , X n the candidates for minimizers of V n 
described in Proposition 3.4 and Corollary 3.5 form the vertices of a regular 
n-polgyon. If (X±, . . . , X n ) is continuously distributed on then there is 

a.s. one and only one intrinsic sample mean. 

Proof. W.l.o.g. assume that the sample is ordered, i.e. — ir < X\ < . . . < X n < 
7r. We consider for fi > the case that Xi < /i — tt < A i+1 for 1 < i < n — 1 
or /i — 7r < Xi for i = 0. Note that equalities Xi = fJ, — n etc. are excluded by 
Theorem 3.1(i); also, observe that /i — tt > X n for i — n cannot lead to a local 
minimum at \x. Then, setting the first sum to zero in case i = 0, 



j=»+i 



n 



%Eft-f) 2 -7D^- x i-^ ( 4 ) 

3=1 3=1 

This is minimal for juW — A + ^p. However, only if Xj + 7r < < X i+1 + ir, 
this minimum corresponds to a local minimum of V n . Similarly, for fi < and 
< \i + 7r < Xi + \ for 1 < i < n - 1 or X„ < /i + 7r for i = n, we get 



' (£(*3'-M) 2 + E(^-M-2 7 r) 2 ] 

\3=1 2=i+l / 



K(m) 

n 

3=1 2=i+l 

which is minimal for /i^ = X — 2n \ n ~ % ) , Again, note that Xi > /i + 7r for n. = 
cannot lead to a local minimum at /Lt. 

With A, = i i -Xj and A, = V™ ... A.,-, we obtain for a local 
minimizer /xW that V^(/iW) = v„.i where 

^"^tf J } + \-(^) 2 + ^(n-x i + x), ^)<o, (6) 

whence V^(/xW) = V„(/zw) for i ^ j implies that there are a,b,c,d £ Z, with 
a 7^ or 6 7^ s.t. aXj + &Xj + cA + c?7r = 0, the probability of which is zero 
for continuously distributed data. □ 

Remark 3.7. That intrinsic sample means of continuous distributions are a.s. 
globally unique has also been observed by Bhattacharya and Patrangenaru (2003, 
Remark 2.6). 
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Figure 1: Numerical example for determining a sample's intrinsic mean; see 
Remark 3.8: n = 10 observations X\, . . . ,X\q from a wrapped normal distri- 
bution are indicated as vertical strokes at the bottom; the curve depicts their 
Ki (/•*); vertical dashed lines give the fj,^' lying on a regular polygon from the 
proof of Corollary 3.6, the corresponding values v n j from (6) are indicated by 
circles; the average X of the data is shown in green, the intrinsic mean in red. 

Remark 3.8. Corollary 3.6 has application for algorithmically determining an 
intrinsic sample mean: in order to do so, determine the minimizers of V n {ijS 1 ') 
for fi^ = X + — mod 2tt , i — 1, . . ,,n; this requires as many steps as there 
are data points in the sample. In fact, (6) easily leads to an implementaion 
requiring 0(n) time for computing the intrinsic mean(s) of a sorted sample. 
An example is shown in Figure 1; note that not all points on the polygon form 
candidates fi^': if X > then X — ^ cannot be a minimizer. Also note that 
V n (fJ>^) and v n> i only agree if /iW is a local minimizer. 

For the computation of population means, Proposition 3.4 allows for a simi- 
lar procedure: compute in each interval with density less than 1/(2tv) the unique 
minizer. 

Finally, we give an illustration to Corollary 3.2. 

Example 3.9. Suppose that X is uniformly distributed on [—it, —tt + Sir] U [tt — 
Stt,tt) with < S < | and total weight < aS < 1, i.e. < a < S^ 1 giving a 
density of #; near ±tt, and with a point mass of weight 1 — aS atO. Then in case 
of a = I, by Corollary 3.2, V(fi) is constant for —Sir < [i < 5tt, and [— Sir, Sir] 
is precisely the set of intrinsic means. Moreover for a > 1, {— aSir, a5n} is the 
set of intrinsic means whereas for a < 1, is the unique intrinsic mean. 



8 



Proof. Indeed, for < [i < Sit we have 

a ( r*+li [—tx+Stx \ 

V(n) = (I- a6) [j 2 + — I (x-^) 2 dx + {x-n) 2 dx\ 

% n \Jtt-5it J-ir+fi J 

= (l-aS)^ + £ ( tt 3 - ((1 - S)n - /i) 3 + ( - (1 - S)ir - M ) 3 + tt 3 ) 

=2tt 3 -2 (l-5) 3 7r 3 -2-3 (1-5)tt 

= ^{3S-3S 2 + 5 3 )ir 2 + (l-a)» 2 

which is constant in for a = 1, minimal for /i = in case of a < 1 and minimal 
for /j, = 8n in case of a > 1. On the other hand for Sir < /i < tt we have 



V(m) = (1 - aS) [i 2 + I (x-fi) 2 dx 

Z7T 



TT-\-StT 

tt—Stt 



= (l-a5)/i 2 +£( ((i + j^-^) 3 ((i-^-^ ) 

= ((l+<5) 3 -(l-<5) 3 )7r 3 -3((l+<5) 2 -(l-<5) 2 )7r 2 p+2-3<57r p 2 

= ^ (35 + 6 3 )tt 2 - 2a5ir fj, + fj 2 

= V(0) + 5 2 a(l - a)n 2 + - a6ir) 2 

which is minimal for fi = a5n. In case of a = 1 this minimum agrees with V(0), 
in case of a < 1 it is larger than V(0), and in case of a > 1 it is smaller than 
V(Stt). □ 



4 Asymptotics 

The strong law of large numbers established by Ziczold (1977) for minimizers of 
squared quasi-metrical distances applied to the circle with its intrinsic metric, 
which renders it a compact space for which the sequence of fi n necessarily fea- 
tures an accumulation point, gives the following theorem; cf. also Bhattacharya 
and Patrangenaru (2003, Theorem 2.3(b)) 

Theorem 4.1. If fi* is the unique minimizer of V and (fJ, n ) n eN o, measurable 
choice of minimizers ofV n , then jjL n — > /i* a.s. 

More generally, if E n denotes the set of intrinsic sample means, and E the 
set of intrinsic population means, then 



P| (J E n C E a.s. (7) 

n=l k=n 

We now characterize the asymptotic distribution of /i n under similar assump- 
tions as in Theorem 3.1, though additionally requiring that the locally unique 
intrinsic mean is in fact globally unique. 
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Theorem 4.2. Assume that the distribution of X restricted to some neighbor- 
hood of —ir features a continuous density f , has Euclidean variance a 2 and that 
fi* = is its unique intrinsic mean. Then the following assertions hold for the 

intrinsic sample mean /x n of X%, . . . , X n i ~ ' X : 
(i) Iff(-n) < i then 



V (l_27r/(-7r))V 

(ii) If /(— 7r) = and if / is (fe — l)-times continuously differentiable in a 
neighborhood U of — n with these k — 1 derivatives vanishing at —tt while 
f is even k-times continously differentiable in U \ {— 7r} with 

* f {k) (n-) = Iim/<*V) = - lim f^(p) = -/«(-tt+) < oo 
then 



2vr/( fe )(-7r+)) 2 / 



Proof. With the indicator function 

Xa(X) = 

we have that 



1 ifleA 
if X £ A 



^X)/ \ = J Xl-Tr./x-Tr)^) if > 0, 

In consequence of Theorem 3.1, if /j, n is an intrinsic sample mean, a.s. none 
of the Xj (j = 1, . . . ,n) can be opposite of [i n . Since the sample mean \i n 
minimizes V n (fi), we have hence with the well defined derivative j^V n (^ n ) that 

_1^_t / / s _ / Vn-X-2n± T!LiX[-w,» n -n){ x j) for/i„>Q, 
2d/" W 1 + E%iX(„ n+ ,,n)( X j) for Mn <0, 

(8) 

cf. (4) and (5). Under the assumptions of (i) above let us now compute 
E(x [ - 7r , M _ 7r) (X)) = [ f(x)dx = /i/(-7r)+o(/*) 

J — 7T 

in case of ^/ > and similarly 

E(x (M+ ^)(Jf)) = -/i/(-7r)+o(/i) 
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in case of \x < 0. In consequence, using that the variance of these Bernoulli 
variables is less or equal than their expectation, we get 



1 ™ / /~\ 

3=1 V V 7 



o(m), 



3=1 



for > and /i < 0, respectively, the bounds for Op(y/ Ji/n) being uniform 
in fi. In conjunction with (8), and using the SLLN for fi n (Theorem 4.1), i.e. 
fJ-n = op(l), we obtain 



;((l-27r/(-7r))/x„-X) =op(1). 



This gives assertion (i). 

Under the assumptions of (ii) we get, noting that by Theorem 3.1 (hi) 
si g n(/( fc )(-7r+)) = -1 while sign(/W(7r-)) = (-l) fc+1 , 

In consequence, as above, we infer from (8) that 

( sign(/x n )l/x n | fc+1 (k) ,^,y\ m 
* I (fc + 1)! * '{-%+) +X I =op(l), 

which gives assertion (ii). □ 

Remark 4.3. We note that under the assumptions in Theorem 4.2, namely 

that f differs from the uniform distribution at — Tt for the first time in its k-th 

_ i 

derivative there, then the convergence rate of /i n is precisely n 2 < fc + 1 ) . 

Comparing with (1), we see that the asymptotic distribution of X is more 
concentrated than the one of the intrinsic mean unless /(— 7r) = 0, the intrinsic 
mean exhibiting slower convergence rates than X if f(— tt) = 



5 Simulation 

For illustration of the theoretical results we consider here examples exhibiting 
different convergence rates: we generalize the density from Example 3.9 to be- 
have like a polynomial of order k near ±n. To be precise, we assume that the 
distribution of X is composed of a point mass at with weight 1 — a5 with 
< a < S~ , < 6 < and of a part absolute continuous w.r.t. Lebesgue 
measure with density g where <?(— 7r + x) = g(Tt — x) = f{x) for < x < it, and 



/(*) = 



for < |x| < <5tt, 
else 
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a k 8 color 



case Oa 0.9 

case Ob 1 

case la 0.9 

case lb 1 

case 2 1 

case 3 1 





1 
1 

2 
3 



0.4 
0.4 
0.4 
0.4 
0.4 
0.4 



blue 
red 



green 
brown 
violet 
purple 



Table 1: Parameters for the simulations, and respective colors used in Figures 2 
and 4. 

for k = 0, while for k > 



Note that /(— 7r) = ^ while f(x)dx = aS. We simulated several examples 
with parameters given in Table 1, the corresponding densities are shown in 
Figure 2. 

Example 3.9 is the special case for which k — 0; in particular, for case 0b, 
we computed 10, 000 intrinsic means, each of which was based on n = 10, 000 
i.i.d. observations, a histogram of these intrinsic means is shown in Figure 3. 
There, the distribution of the intrinsic sample mean for case 0b appears to 
be composed of two parts: an essentially constant density over [~Stt,5tt], the 
set of the intrinsic population mean, and two peaks with their modes located 
close to the interval's endpoints. Their presence can be explained as follows: 
approximately with probability one half, we observe less than (1 — 5)n zeros, 
whence there is too little mass at to keep the intrinsic sample mean in the 
interval [—Sir, Sir] but for n large enough there will with large probability still 
be many zeros so that the intrinsic sample mean cannot move far away from 
that interval. According to Theorem 4.1, these peaks' locations converge to the 
interval's boundary when n — > oo. In fact, our simulations suggest that we have 
^^l^kLn-En H A 7^ with positive probabiliy for any A C [— Stt, Stt] having 
non-zero Lebesgue measure. 

We also determined the median absolute deviation (MAD) of the intrinsic 
sample means for the different cases in Table 1, and compared them to the 
MAD predicted from the asymptotic distribution given in Theorem 4.2 (except 
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Figure 2: Densities of the simulated distributions with parameters in Table 1; 
the dotted vertical line in the centre indicates the point mass at 0. 



8/71 

7/n 

6/ir - 

5/it 

4/rc 

3/n 

2/u 

1/rc -I 



hi 



Figure 3: Histogram of 10,000 intrinsic means each based on n = 10,000 i.i.d. 
draws from case 0b in Table 1. 
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case Oa 
o case Ob 
case 1 a 
— ©— case 1 b 
case 2 

1CT 2 --9- case 3 



10 3 10" 10 5 10 8 

n [log-scale] 

Figure 4: Median absolute deviations of the intrinsic mean of n i.i.d. draws 
from the simulated distributions with parameters in Table 1, based on 1,000 
repetitions; lines give the values predicted using the asymptotic distribution. 




Figure 5: Q-q-plots of 1,000 intrinsic means, transformed and standardized 
according to the asymptotic distribution, each based on n i.i.d. draws from case 
lb in Table 1; the dashed line depicts the identity line. 



14 



for case Ob where it does not apply), see Figure 4. For this, one easily computes 



r 2 = 2 (it - xff{x)dx = 2 / (ir-xff(x)dx + 2 (tt - x) 2 f(x)dx 

J0 JO Jstt 

- - — \f-rr _ r 1 ! 3 !^ - a \ 7f2 T k+1 - J^ T k + 2 + 1 r fc+3ifa 

- 6tt^ 2ir{5n) k[k+1 k+2 k+3 Jo 



n 



r ir 2 (l-2Sf fc+1 , 2*-(l-2<S) fc+2 , _J_„fe+3l(57r 
fct fc+1 x fc+2 x fc+3 x JO 
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as well as / < - fc - ) (0+) = —a A/K k ^ k . Note that we chose the MAD as it commutes 
with the power transforms in Theorem 4.2(h), as opposed to the standard devi- 
ation. Thus, we found the rates predicted in Remark 4.3, namely n~2 if a < 1 
_ i 

and n 2 ( k + 1 ') if a = 1 and k > 0, to match the observed MAD in Figure 4 well. 

Furthermore, for case la, Figure 5 shows normal q-q-plots for the intrinsic 
sample means, transformed and standardized according to their asymptotic dis- 
tribution, i.e. for y/n sign(// n ) |^, t | fc+1 ~ 2 (k+i)\a)~^ ■ Note that there appears to 
be a peak at 0, visible from the curve getting almost horizontal there, which 
decreases with increasing n. 



6 Discussion 



Let us conclude with a discussion of our rather comprehensive results on locus, 
uniqueness, asymptotics and numerics for intrinsic circular means. In the past, 
there has been a fundamental mismatch between distributional and asymptotic 
theory on non-Euclidean manifolds. While a great variety of distributions for 
circular data had been developed which very well reflect the non-Euclidean 
topology, e.g. nowhere vanishing densities, the central limit theorem had only 
been available for distributions essentially restricted to a subset of Euclidean 
topology. On the circle we have eliminated this mismatch. In particular our 
results state that 

the more the non-Euclidean topology is reflected by a probabilty dis- 
tribution, i.e. the closer the distribution near the antipode is to the 
uniform distribution, the larger the deviation from Euclidean asymp- 
totics. 

We expect that similar results are valid for general manifolds where the antipode 
needs to be replaced by the cut locus C(p). Unless C(/x) carries positive mass, 
V(ii) is still differentiable at fi, see Pennec (2006). From what we observed for 
the circle, we conjecture that C(/i) cannot carry mass if [i is a local minimizcr. 
However, generalizing all results obtained here to arbitrary Riemannian mani- 
folds is the subject of future research, but note that our results at once carry 
over to tori, them being cross products of circles. 
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